Check if connection is valid while get client from poll #1957

xizheyin · 2025-04-24T12:14:50Z

May Fixes #1887

It is possible that the client connections in the pool have not been used for a long time, and the server side automatically closes the connection with the client, however, is_closed only checks the status of the local client and not whether the server side has a connection. I added the check to ensure the robustness.

Alternatively, there are other ways to ensure a more continuous connection.

Signed-off-by: xizheyin <[email protected]>

…tion Signed-off-by: xizheyin <[email protected]>

Urgau · 2025-05-19T16:44:55Z

I know much to nothing about our database connections, maybe @ehuss or @Kobzol knows about it, and if this conceptually a good idea.

ehuss · 2025-05-19T18:08:41Z

Just quickly looking at this, it seems like it might have a few issues.

It doesn't look like it manages the permit. That means the permit count and the pool count could get out of sync, allowing more connections than the semaphore is intended to allow.
This doesn't directly address Scheduled jobs often fail due to db timeout #1887, which is more of a theory about idle connections being terminated, and not knowing that until it has been popped off the queue.
If the connection is broken, checking it in get() isn't going to help, since the call to validate_connection could hang, which would be no different from just running a normal query.

When I have more time, I can maybe look at some suggestions. Or maybe @Kobzol has more time right now.

Kobzol · 2025-05-20T10:22:15Z

Tbh I don't think that this is the right approach, as we aren't even sure what was the cause of the original issue. These timeouts also happen in rustc-perf, and it's IMO caused simply by the DB not responding due to being overloaded or a perhaps a network issue. The DB server has been beefied up recently, so maybe these issues aren't even happening anymore? It seems like they stopped happening on rustc-perf once the server was upgraded. So I would keep the connection pooling like it is for now.

xizheyin · 2025-05-20T13:01:53Z

Ok, because I have encountered similar problems before, but there may be various reasons for the original issue. Thank you three for review! I'll close it.

xizheyin added 2 commits April 24, 2025 20:04

Check if connection is valid while get client from poll

c04fc4d

Signed-off-by: xizheyin <[email protected]>

Use tokio::sync::Mutex instead of std::sync::Mutex, and make optimiza…

4724894

…tion Signed-off-by: xizheyin <[email protected]>

xizheyin force-pushed the check-db-client branch from 29d8c19 to 4724894 Compare April 24, 2025 13:27

xizheyin closed this May 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Check if connection is valid while get client from poll #1957

Check if connection is valid while get client from poll #1957

Uh oh!

xizheyin commented Apr 24, 2025

Uh oh!

Urgau commented May 19, 2025

Uh oh!

ehuss commented May 19, 2025

Uh oh!

Kobzol commented May 20, 2025

Uh oh!

xizheyin commented May 20, 2025

Uh oh!

Uh oh!

Check if connection is valid while get client from poll #1957

Check if connection is valid while get client from poll #1957

Uh oh!

Conversation

xizheyin commented Apr 24, 2025

Uh oh!

Urgau commented May 19, 2025

Uh oh!

ehuss commented May 19, 2025

Uh oh!

Kobzol commented May 20, 2025

Uh oh!

xizheyin commented May 20, 2025

Uh oh!

Uh oh!